BEERL: Both Ends Explanations for Reinforcement Learning

نویسندگان

چکیده

Deep Reinforcement Learning (RL) is a black-box method and hard to understand because the agent employs neural network (NN). To explain behavior decisions made by agent, different eXplainable RL (XRL) methods are developed; for example, feature importance applied analyze contribution of input side model, reward decomposition components output end model. In this study, we present novel connect explanations from both ends which results in fine-grained explanations. Our exposes prioritization user, turn generates two levels explanation allows reconfigurations when unwanted behaviors observed. The further summarizes detailed into focus value that takes account all quantifies fulfillment desired properties. We evaluated our applying it remote electrical telecom-antenna-tilt use case openAI gym environments: lunar lander cartpole. demonstrated detailing features’ contributions certain rewards revealed biases components, then addressed adjusting reward’s weights.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents

This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...

متن کامل

The spindle plays both ends

In order to correctly segregate during mitosis, sister chromatids must attach their kinetochores to stable microtubule (MT) bundles, known as K-fibers, that are connected to opposite spindle poles. In 1986, Marc Kirschner and Tim Mitchison proposed a “search and capture” model in which dynamic MTs emanating from the centrosomes would be selectively stabilized if their plus ends happened to cont...

متن کامل

An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic

This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...

متن کامل

Dopamine: Burning the Candle at Both Ends

Dopamine neurons are well known for signaling reward-prediction errors. In this issue, Matsumoto and Takada (2013) show that some dopamine neurons also signal salient events during progression through a visual search task requiring working memory and sustained attention.

متن کامل

Spindle Microtubules: Getting Attached at Both Ends

A recent study describes a novel role for the centrosomal protein Cep57 in attaching spindle microtubules to both kinetochores and centrosomes, suggesting similar mechanisms may be used for generating these two distinct linkages in mitosis.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2022

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app122110947